Multi-species grandcanonical models for networks with reciprocity 
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Reciprocity is a second-order correlation that has been recently detected in all real directed net- 
works and shown to have a crucial effect on the dynamical processes taking place on them. However, 

(N . 

no current theoretical model generates networks with this nontrivial property. Here we propose a 
grandcanonical class of models reproducing the observed patterns of reciprocity by regarding single 
and double links as Fermi particles of different 'chemical species' governed by the corresponding 
chemical potentials. Within this framework we find interesting special cases such as the extensions 
of random graphs, the configuration model and hidden-variable models. Our theoretical predictions 
are also in excellent agreement with the empirical results for networks with well studied reciprocity. 
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The topological properties of networks are known to 
affect crucially the outcomes of dynamical processes tak- 
ing place on them [1,2]. A particularly important role is 
played by the (second-order) correlations between vertex 
degrees, which have strong effects on many processes in- 
cluding percolation and epidemic spreading [2] . Directed 
networks have been recently shown [3] to display an ad- 
ditional type of second-order correlation: the nonrandom 
presence of mutual links between vertices, or reciprocity. 
Nontrivial reciprocity is found in all real networks, and 
provides new insights into their topology [3]. Moreover, 
reciprocity was recently shown to change dramatically 
the properties of percolation [4] and epidemic spreading 
[5], and that these unexpected dynamical properties are 
triggered even by a small fraction of bidirectional links 
[4]. However, despite its ubiquity and relevance, reci- 
procity is currently not reproduced by any model. 

In this Letter we introduce a general theory for net- 
works displaying nontrivial reciprocity by extending vari- 
ous important models (including random graphs [1,2], the 
configuration model [6-8] and the whole class of hidden- 
variable models [9,10]). Since all these models can be 
obtained as particular cases of the general class of 'ex- 
ponential models' [11], a unifying approach is to extend 
the latter to include reciprocity, so that all the particular 
cases are automatically modified accordingly. Therefore 
we first reformulate the standard results for the exponen- 
tial model defined by the 'graph Hamiltonian' [11] 
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where = 1 if a link from i to j is there (and oth- 
erwise), and is the 'energy' (or 'cost') of such a link. 
The grand partition function and grand potential read 



fl = — In Z = — In Zij — Clij 



a -=Y[Z tJ (2) 



i<j 



(3) 



where pi is the chemical potential and 
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We note that pi is not considered explicitly in the litera- 
ture [11] since its role can be played by an additional con- 
stant term in H . However, since in what follows we shall 
introduce more 'chemical species', we prefer to adopt the 
inverse strategy to keep pi and reabsorbe any constant en- 
ergy term into it (this point will be made clearer below). 
This also allows us to obtain many expected topological 
properties as derivatives of f2 with respect to pi. For in- 
stance, the probability pij of a directed link from i to j 
and the expected number (L) of directed links read 



Pij=( a ij} : 
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dpi 
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Many static models can be recovered as particular 
cases of this formalism [11]. For instance, the case = e 
is the directed version of the random graph model: 



H = e ay = eL 

i^j 
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Pij = P 



1 + e-^ 
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where we have reabsorbed e in a redefinition of pi. An- 
other interesting case is the additive one e,y = a, + p3j, 
which corresponds to the grandcanonical version [8] of 
the directed configuration model [6,7]: 
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where we have introduced the 'fugacity' z = e M and the 
'fitness values' Xi = e~ ai , yj = e~ /3j . We finally note 
that, while the above case corresponds to the choice Cij = 
— ln(xjj/j), the general form = e(xi,yj) is equivalent 
to the whole class of (directed) hidden-variable mod- 
els [9,10] defined by the corresponding fitness-dependent 
probability p(xi,yj). Therefore all the most relevant 
static network models can be recovered as particular 
cases of eq.(l). However, in all such cases the expected 
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reciprocity is trivial, since the probability of having a link 
from i to j is independent on the probability of having 
the reciprocal link from j to i [3]. In other words, if we 
write the reciprocity r and its random value r ran d [3] as 



r = 



id = a = 



N(N - 1) 



(8) 



(where L H = a ij a ji is the number of reciprocated 

links), all the above models display the trivial expected 
value (r) = {r ran d)- The only way to have a nontrivial 
value of (r) is adding an extra term to eq.(l): 



H 



E 

^3 



+ H' 



(9) 



This choice defines the reciprocity model first proposed 
in ref. [12] and recently studied analytically by Park and 
Newman [11] in the case = e by treating H' as a 
perturbation. They found that in this particular case the 
perturbation expansion can be resummed to all orders to 
give an exact expression for Z [11]. In our notation with 
e absorbed in fi, the final expression for (r) is 
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and (r) ^ (r ra nd) whenever A ^ [11]. Therefore the 
reciprocity of this model can be tuned to any desired 
value, however other fundamental topological properties 
(such as scale-free behaviour) discovered more recently 
are not reproduced. Moreover, the perturbative approach 
is analytically complicated and yields exact results only 
in the particular case described above. 

A non-perturbative theoretical model which repro- 
duces all the relevant topological properties including 
the reciprocity is therefore missing. We now define a 
general class of such models by regarding reciprocated 
and non-reciprocated links as different 'chemical species', 
each governed by the corresponding chemical potential. 
In particular, we consider each pair of vertices i, j only 
once (say, with i < j) and regard a non-reciprocated link 
from i to j as a 'particle' of the chemical species labeled 
by the symbol (— a non- reciprocated link from j to i 
as a particle of species (<— ), and two mutual links be- 
tween i and j as a single particle of type (<-►). We denote 
the numbers of particles of such chemical species by n~* , 
and respectively, and the corresponding chemical 
potentials by pt~ and //~\ The number of recipro- 
cated links is therefore L*^ = 2n^, and the number of 
non-reciprocated links is L — = + n* - , so that 
L = + + 2n^ . Our formalism corresponds to the 
decomposition of any directed graph with adjacency ma- 
trix dij into three distinct graphs, with adjacency matri- 



ces a„- 



/(1-aji), ajj = cijiil-aij) and a£ 



We can now generalize the Hamiltonian defined in eq.(l) 
to the case with three chemical species: 
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The grand partition function is now 
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where we have defined the vertex-pair partition function 



Zn = 1 + e 
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Note that, when exchanging sums and products in 
eq.(12), we have replaced the sum over the configurations 
{ a ij}> { a ij}i i a ij} with a sum over the allowed states 
(0,7,07,0,7) - {(0,0,0), (0,0,1), (0,1,0), (1,0,0)}, 
nonzero adjacency matrix elements being mutually ex- 
cluding. The grand potential is 
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where flij = — In Zij. Our model is completely defined. 
For each unrepeated pair of vertices i < j, the proba- 
bilities of having a non-reciprocated link from i to j, a 
non-reciprocated link from j to i, two reciprocated links 
between i and j, or no link at all are given by 
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respectively. Note that formally eqs. (15-18) are unde- 
fined for i > j. However, since (— ►) and (<— ) are actually 
the same chemical species which has been 'split' in order 
to consider each pair of vertices only once, we require 
£>7 = Pji f° r i > 3- Similarly, we require p\* = p^* and 
Pij = Pji ■ T ^is is realized by setting pT* = fi*~ , ejj = 
and eX* = e ji l° r i> j- Thus it is enough to specify p~* 
and pXj to define the model completely. We can write 
the ordinary (unconditional) probability p^ and the con- 
ditional probability introduced in ref. [3] explicitly as 
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and the expected values (r) and (r rand ) can be ob- 
tained [3] cither as the average values (r ran d) = 
E l# , Pij/N(N- 1) and (r) = ^ n 3 /N(N- 1) or from 

< n > = (n > = (n } = (21) 
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Note that (r) = (r ran d) if = pij. Unlike the model 
defined in eq.(9), in our multi-species formalism all the 
terms of the Hamiltonian are equally important, with 
no 'perturbations'. As a consequence, our results are 
obtained in a non-perturbative way and are exact for 
all choices of H. This remarkable advantage allows to 
perform otherwise complicated analytical calculations in 
a very simple and direct way. 

Now we consider various special cases of our model. 



The simplest choice ejj = and e\ 



yields 
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.i" (22) FIG. 1. Possible pairs of graphs in the statistical ensemble. 



which is the reciprocity model defined by eq.(9) in the 
case eij — e with the identification = e and = 
2e — A. After and are reabsorbed in and jj,^ , 
this identification becomes ^ = n and ^ = 2/i + A, or 

IjT = 2yT + A (23) 

and we expect to recover eq.(10) through it. We find 



P 



1 + 2e^ + e^" 

(r) = ra = 
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(24) 
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and eq.(25) is indeed equivalent to eq.(10) through 
eq.(23). Therefore we recover the results by Park and 
Newman [11] much more directly, and we can also gener- 
alize them immediately to more realistic Hamiltonians. 



Another case is the additive choice e„- 



-fi + 7j. If we define the non-reciprocated out- and in- 
degrees k~* , k^~ and the reciprocated degree k^* [4,5] as 



K = Y % ^ = Y % ^ = Y ^ ( 26 ) 

3 3 3 

then we can rewrite the Hamiltonian as 



H = Y( a i k ^ + & fc r + n k D 



(27) 



Equation (27) should be compared with eq.(7). While 
in the 'ordinary' configuration model [6,7] the degree se- 
quences {k} n }, {k° ut } appear in H and are preserved 
while higher-order properties are randomized, here the 
same happens for the three degree sequences {k~*}, {k*~} 
and {kp}. We can therefore denote this case as the 
configuration model with reciprocity. The difference in 
terms of the statistical weight of graphs in the ensem- 
ble is shown in fig.l. The graphs G\ and G2 have the 
same {k™ 1 } and {k° ut } and the same {k~*}, {k*~} and 
{k^*}, and are equiprobable in both models. The same 
occurs for G3 and G4. By contrast, G5 and Gq have the 
same {fcj™}, {k° ut } but different {kf}, {kf} and {kf}. 
Therefore in the ordinary configuration model they are 
equiprobable, while in our model they are not. Trans- 
forming Gi into G2 and G3 into G4 (but not G5 into Gq) 



can also be considered as the allowed generalizations of 
the 'local rewiring algorithm' [7] randomizing a network 
to detect higher-order correlations. Here the reciprocity 
is preserved while randomizing the network. This is pos- 
sible since we have two fugacities 
and three fitness variables Xi = e~ ai 
e~ 7i determining the probabilities 
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(28) 
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and governing separately the various expected degrees 

( fc D = Y p ^ ( fc r) = Y p ^ ( fc D = Y p % ( 31 ) 
3 3 3 

The possibility to control the above degrees indepen- 
dently of each other is a remarkable advantage of our 
model. Even if all real directed networks display a non- 
trivial reciprocity structure [3], the modeling of dynam- 
ical processes is mostly performed on purely directed or 
purely undirected networks [1,2]. However, it has been 
recently shown that the dynamics of percolation [4] and 
epidemic spreading [5] crucially depends on the degree 
sequencies {fcp}, {k^~} and {k^}, its general properties 
being different from the simpler behaviour studied on 
purely undirected networks (where kp = fcp = Vi) or 
purely directed networks (where kX* = Vi). It has also 
been shown that on scale-free networks bidirectional links 
act as 'percolation catalysts' [4], since even an infinitely 
small fraction of them determines a phase transition with 
the onset of a giant strongly connected component. Thus 
our model provides a way to generate random networks 
with an explicit reciprocity structure, where dynamical 
processes can be studied more realistically. 

The most general case with arbitrary fitness-dependent 
probabilities p^J = p^{x i ,y j ), p£ = p"(wi,Wj) repre- 
sents the hidden variable model with reciprocity. Each 
vertex is now characterized by three quantities x, y, z 
determining its expected degrees through eqs.(31). As a 
first example, we consider Xi — yi — Wi Vi and 
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(32) (A^B) + (C^D) = (A^B) + (B^D) + (A^C) (37) 



where z~* and z*~* are defined as above. This model re- 
produces perfectly all the topological properties of the 
World Trade Web (WTW), where xi is identified with the 
total Gross Domestic Product of each country i [3,13,14]. 
To see this, note that in this case the conditional prob- 
ability (20) turns out to be constant and equal to the 
reciprocity r of the network, which is an empirical prop- 
erty observed in each snapshot of the real WTW [3,14]: 
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Tij r 1 + e (f- 
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Then note that if we regard the network as undirected by 
drawing an undirected link between two vertices i and j 
if they are connected by at least one directed link in any 
direction, the probability of such an undirected link is 



Hi = Pi 



- Pi 



Pi 



(2z^ + z^)xiXj 
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(34) 



and the above expression is exactly the one which in ref. 
[13] we showed to reproduce all the topological proper- 
ties of the undirected WTW. In other words, eqs.(32) 
describe completely the topology of the WTW, including 
its reciprocity. We recall that, having the form of a Fermi 
function, « 1 for large XiXj, which implies the 'quan- 
tum effect' [8] that the WTW is not scale-free even if x is 
empirically found to be power-law distributed [13]. Note 
that this model can also be obtained from eq.(27) set- 
ting a.i = fa = 7i and introducing the undirected degree 
ki measured on the undirected version of the graph: 



'■i ki 



(35) 



Our second example concerns shareholding networks 
(SN). In ref. [15] it has been shown that the 'ordinary' 
hidden-variable model successfully reproduces the prop- 
erties of SN if the unconditional probability has the form 
Pij = P( x ii Vj) = Uj f( x i) where yj is the wealth invested 
by the agent j and Xi is the information associated to the 
asset i. On the other hand, in ref. [3] we showed that the 
SN for NYSE and NASDAQ have no reciprocated links, a 
property not reproduced by the above form for . Our 
model reproduces all these properties by setting 



Pi 



= y?f(xi) 



p,7 = o 



(36) 



We recall that, unlike the WTW, SN are in the 'classical 
limit' where the empirical power-law distribution of y is 
reflected in a scale- free degree distribution [15]. 

We finally propose an interpretation of eq.(23) in terms 
of a 'chemical reaction' converting the chemical species 
(— >), (<— ) and (<->) into each other. Let us first consider 
our system when A = 0. Since in this case the graphs 
G5 and Gq in fig.l have the same statistical weight, 
their 'particles' must be connected through the follow- 
ing chemical reaction which is at equilibrium: 



The condition for equilibrium is obtained by replacing 
in the above expression each chemical species with its 
chemical potential, which gives /i" = 2/i - *, consistently 
with eq.(23) since A = 0. When A ^ the graphs G$ 
and Gq have different statistical weights, meaning that 
the above chemical reaction occurs with the release of 
an additional 'energy' A such that fi*^ = 2/i - * + A as in 
eq.(23). When A > the reaction is 'esothermic' and the 
production of reciprocated links is energetically favoured, 
while when A < the reaction is 'endothermic' and the 
production of reciprocated links is suppressed. 

We have introduced the first theoretical model repro- 
ducing the nontrivial properties of real networks includ- 
ing their reciprocity. Our results provide an improved 
characterization of the network topology and a basis for 
the investigation of the effects of reciprocity on network 
dynamics. These ideas, inspired by the Fermi statistics 
of multispecies systems, can be directly generalized to 
networks with different types of links (in preparation). 
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